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Abstract 

We introduce and study a class of infinite- horizon non-zero-sum non-cooperative stochas- 
tic games with infinitely many interacting agents using ideas of statistical mechanics. 
First we show, in the general case of asymmetric interactions, the existence of a strat- 
egy that allows any player to eliminate losses after a finite random time. In the special 
case of symmetric interactions, we also prove that, as time goes to infinity the game 
converges to a Nash equilibrium. Moreover, assuming that all agents adopt the same 
strategy, using arguments related to those leading to perfect simulation algorithms, 
spatial mixing and ergodicity are proved. In turn, ergodicity allows us to prove "fixa- 
tion" , i.e. that players will adopt a constant strategy after a finite time. The resulting 
dynamics is related to zero-temperature Glauber dynamics on random graphs of pos- 
sibly infinite volume. 
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1 Introduction 



The aim of this paper is to study a class of stochastic games with infinitely many interacting 
agents that is closely connected with a Glauber-type non-Markovian dynamics on random 
graphs. Let us briefly explain the setting and our contributions both from the point of 
view of game theory and of physics, referring to the next section for a precise construction 
of the model. Our central results are theorems [TJ [2] and [3] below. 

We consider an infinite number of agents located on the vertices of the two-dimensional 
lattice, where each agent is randomly linked with others, and has positive or negative 
feelings regarding them. Moreover, each agent is faced with the need of taking decisions 
that affect himself and all others to whom he is linked. The objective of each agent is 
to take (non-cooperative) decisions that ultimately do not affect him negatively. Under 
a specific choice of the payoff function of each player, we shall prove that there exists a 
decision policy achieving this goal, and even more, that if each player adopts this strategy 
a non-cooperative Nash equilibrium is reached. 

From the physical point of view, we study a Glauber-type dynamics on a random graph 
with the following features: the dynamics is non-Markovian and has long-range interactions, 
in the sense that the maximum distance between interacting particles is unbounded. For 
such dynamics we prove spatial mixing (hence ergodicity) and fixation. To the best of our 
knowledge, these problems are solved here for the first time, even in the simpler case of a 
standard Glauber dynamics on random graphs. Problems of dynamics on random graphs 
have attracted a lot of attention in recent years (see the monograph [4] for an extensive 
overview), as these structures are often more realistic models of several phenomena than 
classical deterministic structures (e.g. in network modeling, spread of epidemics, opinion 
formation, etc.). For instance, C. Cooper and A. M. Frieze [2] prove the existence of a 
critical coupling parameter at which the mixing time for the Swendsen-Wang process on 
random graphs of sufficiently high density is exponential in the number of vertices; M. Dyer 
and A. Frieze [5] study the rapid mixing (in time) of Glauber dynamics on random graphs 
with average degree satisfying a certain condition (see also A. Frieze and V. Juan [8] for 
a related result). In J. P. L. Hatchett et al. [TO], the authors analyze the dynamics of 
finitely connected disordered Ising spin models on random connectivity graph, focusing on 
the thermodynamic limit. I. Perez Castillo and N. S. Skantzos [T7j study the Hopfield 
model on a random graph in scaling regimes with finite average number of connections 
per neuron and spin dynamics as in the Little-Hopfield model. On the other hand, as 
mentioned above, even though (spatial) mixing is one of the most natural questions to ask 
about stochastic models of interacting particle systems, it has not been discussed in the 
literature, to the best of our knowledge. It is probably important to recall that mixing 
is a key ingredient to obtain further results, such as ergodicity. Moreover, just to cite 
another important application, using Stein's methods (see e.g. [lj), mixing implies the 
central limit theorem, which gives qualitative estimates on the number of sites (or agents) 
with a positive spin (or opinion) in large regions of the graph. 
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We would like to stress that our results on mixing are quite general, and if one is only 
interested in the physical aspect of our work, they could essentially skip the part of the 
paper which deals with stochastic games, and concentrate only on the physical aspect. 

Let us briefly discuss how the model and results of the present paper are related to 
the existing literature on using methods of the theory of interacting particle systems in 
economic modelling and game theory. One of the first and still most cited works on the 
subject is a paper by H. Follmer [6], who considered a pure exchange economy with (count- 
ably) infinitely many agents, each of which having random preferences and endowments. 
In particular, agents are located on the vertices of the d-dimensional lattice Z d , and their 
preferences can be influenced by all his neighbors (i.e. such that their euclidean distance is 
one). The author then considers the problem of existence of a price system stabilizing the 
economy. See also E. Nummelin |16j for further results in this connection, but with a finite 
number of agents. In U. Horst and J. Scheinkman |12j the authors study a system of social 
interactions where agents are located on the nodes of a subset of Z rf , and each of them is 
provided with a utility function and a set of feasible actions. The behavior of an agent 
is assumed to depend on the choices of other agents in a reference group, which can be 
random and unbounded. The authors, in analogy to our case, work under the assumption 
that the probability of two agents being linked decays with distance, and are concerned 
with the existence of equilibrium (in the classical microeconomic sense). U. Horst [TT] 
determines conditions such that non-zero-sum discounted stochastic games with agents in- 
teracting locally and weakly enough have a Nash equilibrium. While the set of feasible 
actions in this paper is much richer than in ours, we do not assume to have any knowledge 
on the reference group of each agent, apart of being finite almost surely. We only allow 
agents to be able to observe the dynamics of a (local) configuration around them. As a 
result of the structural differences in the settings, the optimal strategy in [11] is Markovian, 
while in our case it can never be Markovian. 

In general, the following features of our setting and results could be particularly in- 
teresting from a game-theoretic perspective: we consider games where interactions among 
agents are not known a priori, and we explicitly construct a strategy that leads the game 
to equilibrium, while the typical result of game theory is the existence of equilibrium and 
a characterization of optimal strategies at equilibrium. 

Let us also briefly recall that several other models of interacting particle systems admit 
a natural interpretation in terms of social interaction. Well-known examples are the voter 
process (see e.g. T. Liggett p2]), used in models for the formation and spread of opinions, 
or the Sherrington-Kirkpatrick model of spin glasses (see e.g. section 2.1 in M. Talagrand 
|18j). Infinite interacting particle systems have found applications in sociology as well: see, 
for instance, T. Liggett and S. Rolles [H] for a model of formation of social networks. 

The organization of the paper is as follows: in section [2] we describe the model so we 
show how agents interact and what their aim is; in section 3 a general strategy achieving 
the goal of each agent is given, and section 4 proves spatial mixing, hence ergodicity, of the 
dynamics, when all agents adopt the same strategy. Finally, using the results on spatial 
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mixing and ergodicity, we prove that the game "fixates", i.e. that agents will adopt a 
constant strategy after a finite time. This phenomenon is reminiscent of the fixation of 
zero-temperature dynamics (see e.g. E. De Santis and C. M. Newman [3J, L. R. Fontes [7J, 
O. Haggstrom [9j, S. Nanda, C. M. Newman and D. L. Stein [l5j). 

2 Model and problem formulation 

Let us first introduce some notation used throughout the paper. We consider the two- 
dimensional lattice Z 2 with sites x = (xx,X2) and distance d defined by 

d{x,y) = \xx-yx\ + \x 2 -y2\. (1) 

The cardinality of a subset r C Z 2 is denoted by |T|. We denote by Am the set of all 
x £ Z 2 such that d(0,x) < M, with O = (0,0). If x € Z 2 , A M (x) stands for A M + x. 
Our configuration space is S = { — l,+l} z2 . The single spin space { — 1,+1} is endowed 
with the discrete topology, and S with the corresponding product topology. Given 77 G S, 
or equivalently 77 : Z 2 — > {— 1,+1}, and A C Z 2 , we denote by 77A the restriction of 77 
to A. Given a graph G = (V, E), where V and £ are the sets of its vertices and edges, 
respectively, we shall denote by {x, y} an element of E connecting x, y £ V. For any 
x £ V, we shall denote by p x the distance of the longest edge having x as endpoint, namely 
we define 

p x = sup d(x,y). 

y. {x,y}eE 

Recall that the distance in variation of two probability measures \x and v on a discrete set 
$7 is defined as ^ 

IIm-^II = 2 \^ - 

We shall now introduce an idealized model of a large ensemble of interacting individuals. 
The ingredients will be a random graph, a function on its edges (specifying an environment, 
roughly speaking), and a stochastic process with values in S describing the time evolution 
of the system. 

Let Q = (V, E) be a random graph, whose set of vertices V is given by all sites of the 
2-dimensional lattice Z 2 , and whose set of edges E satisfy the following conditions: edges 
exist with probability one between each site x and all y such that d(x,y) = 1, and 

P(|{y : {x,y} £ E}\ < 00) = 1 Vx G V. (2) 

We suppose that each site is occupied by an individual (we shall often identify individuals 
with the sites they occupy, when no confusion will arise) , and that relations among individ- 
uals are modeled by the edges of Q and by a function j : V x V — ► {—1, 0, +1}; j(x, y) = 
if {x, y} £ E, otherwise j(x, y) £ {—1, +!}• 
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In particular, we shall say that individuals x and y are linked if {x, y} G E, and the 
value j(x,y) shall account for the "feelings" of x towards y: we set j(x,y) = +1 if x is a 
"friend" of y, and j(x,y) = —1 if x is an "enemy" of y. We do not assume symmetry of 
j, i.e. friendship of an individual towards another may not be reciprocal. Moreover, we 
assume that individuals do not know with whom they are connected, nor whether these 
individuals are friends or enemies. Note also that in this model x can be friend of y, y friend 
of z, but x and z can be either friends or enemies (a phenomenon also called frustration in 
physics) . 

Let us now introduce a stochastic process a : [0, oo) — > S modelling the evolution of 
the "action" (or opinion) of the individuals. We shall use a graphical construction of the 
process, which provides a specific version of basic coupling, i.e. it provides versions of the 
whole family of stochastic processes on Q (or on any finite subset of it), all on the same 
probability space. We assume that the initial configuration <7o is chosen from a symmetric 
Bernoulli product measure. Moreover, the continuous-time dynamics of at is given by 
independent Poisson processes (with rate 1) at each site x € V corresponding to those 
times (t x ,n)nGN when the individual x is asked to update his opinion. Before describing the 
set of feasible ways of opinion updating, let us introduce a reward for a generic individual 
x at time t x>n , as a result of his action: 



where we have set, for simplicity, t — t xn . 

We allow x to base his decision on the history of 0"s s (s), s > 0, and h(x), where E s 
are finite balls centered in x with random radius which is nondecreasing with respect to s, 
finite almost surely for all s > 0, and not 'exploding'. Formally, the decision of individual 
x at time t xn is a { — 1, +l}-vahied random variable u Xy n 

measurable with respect to the a- 
algebra generated by {cs s (s), s < t x>n } and {h s (x), s < t X;7l }, where H s are balls centered 
in x such that 



exists and is finite with probability one. We shall denote by £f the filtration just defined. 
The dynamics of a is then completely specified by the updating rule 



Several remarks are in order: the reward ht{x) obtained by individual x as a result of his 
decision at time t = t X:7l is positive if the difference between pleased and damaged friends is 
bigger than the difference between pleased and damaged enemies, negative if the opposite 
happens, and zero if the value is the same. Since at a fixed arrival time t = t Xyn of the 
Poisson clock of x no other clock is ringing, i.e. P(t J/jm = t) = for all y ^ x and for all 





°t x , n (x) = U X:n . 
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m G N, the dynamics of a is well-defined (also using the graphical construction). Finally, 
at any positive time t, at(x) represents the last decision taken by individual x up to time 
t. 

We formulate the following problem for the generic individual x: find a strategy ir x = 
{u x ,i,u x ,2, ■ ■ •) such that 

ht{x) > a.s. 
for all t >T X , where T x is a finite (random) time. 

Remark 1. We built the random graph Q on the two-dimensional lattice 1? to give a 
"geographic" dimension to the problem and to have a simple notion of distance on the 
graph. However, all results in the next section still hold replacing 1? with any higher 
dimensional lattice Z d , d > 3. We shall see below that choosing d = 2 also affects a 
constant appearing in an assumption used to prove spatial mixing. 

3 Admissible strategies that eliminate losses 

In this section we construct explicitly a strategy -k x for the generic individual x that asymp- 
totically eliminate negative rewards, i.e. such that F(ht(x) > 0) = 1 for all t greater than 
a random time, which is finite with probability one. It will also be clear that this strategy 
is non-cooperative, that is n x eventually eliminate negative rewards irrespectively of the 
strategies adopted by all other individuals. 

For simplicity of notation let us describe the strategy ir = ttq for the individual located 
at the origin O. The arrival times of his Poisson process and the corresponding decisions 
and rewards will be denoted by t n , u n , and h n , n G N, respectively. 

The strategy it = (ui,v,2, ■ ■ ■) is best defined algorithmically through a decision tree. 
We also need an additional "data structure" , i.e. a collection 1Z of ordered triples of the 
type (rj,u,h), where rj G S is supported on finite balls, u £ {— 1,+1}, and h £ {— 1,0, +1}. 

At the first arrival time t\, u\ is chosen accordingly to a Bernoulli law with parameter 
1/2 (a "fair coin toss"), and {(T\ x , u±, h\) is added to TZ. 

The description of the algorithm then follows inductively: at time t n +i> l e t h-M n be the 
support of the last configuration added to 1Z. Let a' := &A Mn (t n+ i—) and check whether 
there exists (a',u',h') £ 1Z. 

• If yes, set 

,h! 

u n+1 = u Wy 

with the convention 0/|0| := 1. The reward h n+ i corresponding to u n+ \ is now 
obtained. 

— If h n+ \ > 0, no further action is needed. 

— If h n+ \ < 0, then add to 1Z the triplet (a\ Mn+1 ,u n+ i, h n+ \). 
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• Otherwise, set u n+ \ = u n , and add to 1Z the triplet (a',u n+ i, h n+ i). 

The above algorithm formalizes the following heuristic procedure: the agent starts looking 
at the configuration on the smallest ball centered around him and plays tossing a coin. 
The next time his clock rings, he checks whether he has already seen such a configuration. 
If it is a new one, he will again memorize it and play by tossing a coin, while if it is a 
known one he will play as he did before if he got a positive reward, or the opposite way if 
he got a negative reward. Of course it could happen that this way of playing still does not 
guarantee a positive reward, in which case he will memorize the configuration on a larger 
ball around himself and its associated outcome. 

Remark 2. One of the key steps of the algorithm requires one to look for a triplet (a 1 , u',h') 
in 1Z, given a' = cja, for a certain Ac2 2 . This operation is uniquely determined, i.e. there 
can exist only one triplet (a', u', h!) G 1Z with a given a' . This can be seen as a consequence 
of the structure of the algorithm itself. Namely, as soon as the player "observes" the same 
configuration a' = o"a with a different associated outcome h, he will immediately enlarge 
the support of observed configurations A. 

We shall now prove that the strategy just defined eliminates losses for large times. 

Theorem 1. For any individual x there exists a random time T x , finite with probability 
one, such that 

^{h tvTx (x)>0j =1. 

Proof. Let us define a sequence of random times (r n ) n€ N as follows: 

r n = inf{n G N | 3(cF, u, h) G 7Z, suppa = A n+ i}, 

with the convention that inf = +oo. In other words, r n is the first time that individual x 
includes into his information set 1Z the box A n+ i (and if this never happens, then r n = +oo) . 
Let Tk be the last finite element of the sequence (r n ) ng N- By assumption ([2]) we know that 
\{y : {x,y} G E}\ is finite, hence k < p x because h t (x) only depends on those y linked to 
x, for all times t. Therefore the biggest A n observed by the agent in the origin is finite. 
Define the family of sets 

A k (t) = {a Ak (t x/ ) : t x/ G [r k ,t]} . 
It clearly holds A k (ti) C A^fo) for t\ < t2, hence we can define 

A k (oo) = lim A k (t). 

f\oo 

Since A k (t) C {— l,+l} Afc , and A^ is finite, then there exists T x > such that A k (T x ) = 
A k (oo), hence 

A k {t) = A k (x) Vt>T x . 
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We claim that h x>n > for all t x n > T x . In fact, for every t XjTl > T x there exists (a, u, h) £ 
1Z with a(y) = cr txn (y) for all y £ A&. But since Tk+i = oo, the algorithm will give as 
output a u n such that h n > (to convince oneself it is enough to "run" the algorithm). In 
a more suggestive way, one could say that after T x individual x has already been faced at 
least once with all possible configurations that are relevant for him, and therefore knows 
how to take the right decision. □ 

Remark 3. (i) Note that the strategies of other individuals never enter into the arguments 
used in the proof. Therefore individual x is sure to reach the goal of eliminating losses in 
finite time irrespectively of the strategies played by all other individuals. 

(ii) However, we would like to stress that the random time T x is not a stopping time 
(i.e. it is not adapted to the filtration £f). In fact, T x depends in general on the decisions 
of other individuals, whose policies are not necessarily adapted to £f. In general, even if 
all policies were adapted, the random times {T x } x&1 2 would not be stopping times. 

(iii) Let us also observe that although we formally allowed the strategy tt x to be adapted 
to £f , the information used by the strategy constructed in the proof of Theorem Q] is 
much smaller. Similarly, one could refine the way the memory structure 1Z is constructed, 
for instance by eliminating configurations on smaller balls, when one starts to add new 
configurations on balls of higher radius. However, we preferred to keep the construction of 
1Z as it is to avoid non-essential complications. 

As a consequence of theorem Q] and of observation (i) in the above remark, one has 
the following result, which essentially states that the games admits an "asymptotic" Nash 
equilibrium. 

Proposition 1. Let M G N and assume that each player x S Am := [—M,M] x [—M,M] 
adopts the strategy tt x defined above. Then there exists a finite random time Tm after which 
no agent can gain by any change in their strategy given the strategies currently pursued by 
other players. 

It is important to observe that in the above proposition we implicitly assume that 
each player only cares about "not loosing", or equivalently he distinguishes only between 
"loosing" (h t (x) < 0) and "not loosing" (h t (x) > 0). In this sense, after Tm, there is no 
point for any player x E Am to change his strategy, as proved in theorem [TJ The statement 
of the proposition is in general false if the player distinguishes between ht(x) > 0, ht(x) = 0, 
and h t {x) < 0. 

We think that one can prove (and we leave it as a conjecture), that this asymptotic 
equilibrium is not Pareto. This could be done adapting ideas of O. Haggstrom [9], who 
proved that zero-temperature dynamics on a random graph does not reach the minimum 
energy configuration. 
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4 Spatial mixing and ergodicity 



The main result of this section, which plays an essential role in the results about fixation 
of the next section, is that a spatial mixing property holds. We shall work under the 
following hypothesis, which states that the probability of two agents being linked decays 
algebraically with their distance. 

Standing assumption. It holds that 

for all y such that d(x,y) > 1, where C is a positive constant. 

Note that assumption ([3]) implies d5J. Moreover, the exponent appearing on the right-hand 
side of ([3j) depends on the dimension of the lattice and it is needed in order to use well- 
known combinatorial estimates on path counting in Z 2 in the proofs to follow. However, 
it would not be difficult to generalize our arguments to any higher dimensional lattice Z rf , 
d > 3, at the expense of replacing the exponent 9 with a (higher) constant depending on 
the dimension d, and of using more complicated estimates in the proofs. Since this point 
is not essential and would only add technical complications, we preferred to fix d = 2. 

Before stating the main theorem of this section, we need to introduce the following set 
of conditions. 

Hypothesis H. The random graph Q = (V,E) and the process a : [0, oo) — > {— l,l} z2 , 
satisfy the following conditions: 

(i) For each vertex x G Z 2 there exists a Poisson process P x , and the Poisson processes 
{P x }x€Z 2 are mutually independent. Denoting by T x = {t xn } the set of arrival times 
of P x , the value of at{x) is allowed to change only at times t G T x . 

(ii) Given any couple (x,y) G Z 2 x Z 2 , the probability P({x,y} 6 E) is defined and it 
can depend on d(x,y). Moreover, for any choice of (x,y), (v,w) G Z 2 x Z 2 with 
(x,y) / (v,w), the events {x,y} G E and {v,w} G E are mutually independent. 

(Hi) The evolution of the process is local, i.e. o- tx n {x) is measurable with respect to , 
where 3? denotes the a -algebra generated by {a s {y) : {x,y} G E or y = x, s < t}. 
We denote by J-'Y the a -algebra generated by U^gySf . 

(iv) Both the probability of two agents being linked and the evolution of a are translation 
invariant, i.e. F({x,y} G E) = F({x + v, y + v} G E) and¥(a t e A\a = v) = ^( a t G 
A + v\a {-) = n( + v)). 

We can now state the main theorem of this section. 
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Theorem 2. If Hypothesis H holds true, then a satisfies the spatial mixing property 

lim F(a Ao (t) = r)\F t AC ) = ¥(a Ao (t) = r,), (4) 

where Aq is any finite region in 1? . 

Note that the process a is translation invariant if each agent adopts the same strategy 
at each decision time (the strategy does not need to be the one defined in section 3). Before 
giving the proof of the theorem, we establish some auxiliary results. 

We shall use the following terminology: by "box of side length L" we mean the set 
[-L/2,L/2] 2 C I 2 . For p < 1, we call "subbox of side length L p " any one of the L l ~ p 
square sets into which a box of side length L can be subdivided. We always assume L p , 
L l ~ p G N (without loss of generality, as it will be clear). Furthermore, we shall say that 
two subboxes R and S are "neighbors" if d(R, S) < y/2, so every subbox has 8 neighbor 
subboxes. We shall call "path of subboxes" a sequence of subboxes {Rk)k=i,...,K such that 
Rk and Rk+i are neighbors for each k = 1, . . . ,K — 1. Two subboxes R, S are "linked" if 
there exist x G R, y G S such that {x, y} £ E. 

In the following lemma we introduce a sequence of boxes increasing to Z 2 , each of one 
further subdivided into a variable number of boxes also increasing to Z 2 , but at a lower 
rate. 

Lemma 1. There exist a sequence of integer numbers L n j +oo, a sequence of square boxes 
Ql„ °f side length L n , each of them partitioned into subboxes of side L P U p = 13/42 ; such 
that only a finite number of the boxes Qh n will contain linked non-neighbor subboxes. 

Proof. We use a Borel-Cantelli argument on a suitable sequence of box side lengths L n . In 
particular, let L be a positive integer, Ql a square of side L, subdivided into subboxes of 
side L p . The probability of an agent x to be linked with some other agent of a non-neighbor 
subbox is bounded by 

C f°° 1 1 

£ d(x-^- Ci L v^ vdv=c ^ 

y:d(x,y)>LP V ' y > J LP 

where C, C\, C2 are positive constants. Therefore two agents in non-neighbor subboxes 
exist with probability not larger than 

y.d(x,y)>LP y ,yj 

Taking now a subsequence L n growing to infinity rapidly enough, 

J>(A L J<oo, 

where Al„ denotes the event that Ql u contains linked non-neighbor subboxes. By Borel- 
Cantelli lemma, only a finite number of occurrences of A^ n can happen, which finishes the 
proof. □ 
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Recall that for a sequence of i.i.d. standard exponential random variables {X{\ one has 

n 

P ( 2 Xl < na ) - e ~ Ha)n Va < EXi, (5) 

i=l 

where the so-called rate function <3? is given by 

$(a) = a — 1 — log a. 
Proof of Theorem^ We use a coupling argument to show that 

sup |P(ffAo(*) = »7kA«(0) = CO " PK(t) = »7kAc(0) = C")| - (6) 
C',C" 

and hence, by the inequality 

sup |P(a-Ao(t) = r]\a A c(0) = (') - F{a Ao (t) = r/|<7 A e(0) = C")| 
C',C" 

> |P(ff Ao (i) = VMO) = " n<T Ao (t) = T])\ VC, 

that flU) holds. 

We construct two coupled systems a', a" on the same probability space supporting a in 
the following way: <r' x (0) = 0^(0) = cr x (0) for all x G A; o 7 and a" update their state 
according to the same translation-invariant rule of a; all other randomness in the system 
(the random graph, the Poisson processes, the "coin tosses" needed for the decision rules) 
coincide. Define, for any x G V, the random time 

r x = inf{i > : a' x {t) + o%(t)}, 

and introduce the process 

[0,oo) x V 3 (t,x) ^ u x {t) = l(t>T x ) G {0,1}. 

Using a pictorial language, we shall say that we color x with black as soon as the two 
processes a' and a" differ at x. Let us also introduce another process v : [0, oo) xF -* {0, 1} 
with the property v x {t) > v x {t) a.s. for all x and all t. The dynamics of v is specified as 
follows: P x (0) = for all x G V, and v x can turn to one as a consequence of two classes of 
events. In particular, (i) u x {t) = 1 if there exists x' belonging to the same subbox of x such 
that v x t{t) = 1, and (ii) v x {j) = 1 if there exists y belonging to a neighbor subbox such 
that v y (j) = 1, where r is any arrival time of the Poisson process relative to x. Moreover, 
we assume that 1 is an absorbing state for D x , for all x. Using again a pictorial analogy, 
we could say that the black area generated by v is bigger than the black area generated 
by v. In particular, as soon as a site x turns black, (i) implies that the whole subbox to 
which it belongs becomes black as well. 
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By Lemma [H there exists a positive integer TV and a sequence L n such that for all n > N 
the boxes Qi n contain no linked non-neighbor subboxes. The shortest path of subboxes 
from the boundary of the box Ql„ to its center has length L n p /2 (therefore, for n large 
enough, the shortest path of subboxes from the boundary of the box Ql„ to Ao has length 
greater or equal than L 1 n T p /2 — 1). Setting Tq lp = vcii x& Q LP t Xt \ (recall that t Xt \ is the 
first arrival time of the Poisson processes relative to x), one has that the distribution of 
Tq lp is Exp(L 2p ), where Exp(A) stands for the law of an exponential random variable with 
parameter A. The minimum time for the formation of a path of k "black" subboxes along 
a fixed path (of sites) from the boundary of Ql„ to the origin is given by 

k 

i=i 

for all n > N (from now we shall tacitly assume n > N), where Ti,...,Tfc are i.i.d. 
exponential random variables with parameter L n p (independence and the value L n p follow 
by the memoryless property of the exponential distribution). 

Note that the sequence of subboxes in a path turning black does not influence the 
minimum time needed for the formation of such path, which is a sum of independent 
exponential random variables of parameter L n p , using again the memoryless property of 
exponential distributions. It follows by © that, for < a < 1, one has 

P(T < kaL- 2p ) < e -(a-i-log«)fc, 

Denoting by Tqq l _>o the (random) time needed to form a path of black subboxes from 
the boundary of Qh n to the origin O, we obtain the estimate 

^(T dQh ^o < f L n - 3p ) < 4L n - p 8-7 fc - 1 exp(-(a-l-loga)fc), 

k>^ 

hence, for < a < j^, 

f2 mP(r« 3 ^ o <fL^)=0. 

Here the term \L X n p accounts for the possible initial subbox on the boundary of Ql, and 
8 • 7 fc_1 is an upper bound for the number of paths (of subboxes) of length k starting in a 
given subbox. We obtain that, as n — » oo, the term on the right hand side goes to zero like 
e -/3L„ p ( mocm i polynomial terms), with (3 a positive constant. Again by a Borel-Cantelli 
argument we obtain 

P( lim T 9Ql ^ = oo) = 1. 

Moreover, the evolution of the central subbox is completely independent on the configura- 
tion outside A until it turns black, and so the theorem is proved. □ 
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Remark 4. Although ([6]) has been proved only for a particular choice of a sequence of 
increasing boxes A n , one can easily show that any increasing sequence of boxes will do. In 
fact, the supremum appearing in Q is decreasing with respect to A, hence it is enough to 
prove the theorem for any (fixed) subsequence. 

5 Fixation 

In this section we shall work under the general assumptions introduced in section 2 and 4, 
and furthermore we assume that each player adopts the same strategy (hence the dynamics 
is translation invariant), and that interactions are symmetric, i.e. that j(x,y) = j(y,x) 
for any x, y £ V. The latter hypothesis is essential, as it would be possible to find 
counterexamples to our results in the case of asymmetric interactions. As before, we shall 
denote by x an arbitrary agent, fixed throughout this section. Let us define the random 
time T x as 

T x = sup{t : at time t agent x sees a new configuration or loses}. (7) 

As it follows from Theorem [TJ T x is finite with probability one. Moreover, by definition, 
agent x will not loose at any time after T x . Let us also define the random variable M x as 
the number of times agent x changes his state (i.e. updates his opinion) during the time 
interval (0, +oo). 

The main result of this section is the following: 

Theorem 3. Assume that each agent adopts the strategy constructed in section 3. Then 
each agent x £ V updates his opinion only a finite number of times, i. e. 

F(M X < oo) = 1. 

Before proving theorem [3l we shall need some more definitions and preparatory results. 
Let us recall the definition of p x : 

p x = sup d(x,y), (8) 

y. {x,y}£E 

the distance from x of his farthest connected agent. Note that one has, as follows by the 
standing assumption (J3j) , 

s>r 

where C and K are constants depending on x. Therefore ^p x , 1 < k < 5 are finite: 

oo oo oo „ 

^p x < i + rk¥ (p = r )< i + Yl rk ^p - r ) - 1 + rk -&Te < °°- ( 10 ) 

r=2 r=2 r=2 T 
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Let us also define the energy (or Lyapunov) function on a finite set A C Z as 

H A (a) = -Y,hu(cr), (11) 

where 

~K{<?)= ^2 j{u,v)a u a v . (12) 

v:{u,v}£E 

In the following we shall denote by A n the square box [— n, n] x [— n, n]. 
Lemma 2. There exists a continuous function e : M+ — ► [— Kp 2 ,Kp 2 ] such that 

lim gA >, (t)) = e(t) a.s. (13) 
n— >oc |A n | 

Proof. By the definitions of h x (a(t)), p x , it follows that for each time t 

< M<K*)) < Pi 

hence, taking expectations, recalling (|10p . and using translation invariance 

-oo < -Ep 2 < Eh x (a(t)) < Ep 2 Q < oo. 

At any time i, using the space ergodicity of the system (implied by the spatial mixing 
property proved in Theorem [2|), we obtain 

lim HA ;[ <Ti : t ^ = WkoMt)) a.s. (14) 

rwoo |A n | 

Setting e{t) = Kho(c r (t)), we just have to prove that e is continuous. Using again the 
spatial ergodicity of a, the proportion of agents in A„ taking at least a decision in the time 
interval }h,t 2 [ tends to 1 - e"^' 1 ) < t 2 - h as n — > oo. Since each agent is endowed 
with a Poisson process that is independent from all other processes and random variables 
describing the dynamics of the system, the mean energy variation of each agent is bounded 
by Kp 2 . Therefore we also have 

\e( h )-e( tl )\ = lim Z E^Ml 

n -^°° | A n | (15) 

< (1 - e^-^Ep 2 < (t 2 - t^Ep 2 , 
i.e. the function e is Lipschitz continuous. □ 
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Let us now define the following discrete random sets for agent x, which are subsets of 
the set of arrival times of his Poisson process: 

Ni(x) = {t : t < T x ,the agent in x sees a known configuration at time t and loses} 
^(x) = {t : t < T x , there is an arrival of the Poisson process in x} \ Ni(x) 
Ns(x) = {t : t > T^the agent in x changes opinion} 

Note that by definition of T x , at any time t > T x agent x can only see known configurations, 
and can only win. 

We also define, for every t > and x E Z 2 , the random sets 

Ni(t,x) = Ni(x)n[0,t], 

for i = 1, 2, 3. 

Moreover, for A C Z 2 , we set 

JVi(i,A) = |J Ni{%x). 

The dynamics of the system and the definition of e{t) imply that e(t) is determined only 
by the changes of ov(-), r G {Ni(t, x)} xe %2, i = 1,2,3. We can therefore write 

e(t) = e 1 (t) + e 2 (t) + e 3 (t), 

where ei(t) denotes the component of e(t) determined by changes of o>(-) for r G {Ni(t, x)} x( zj2 . 
Moreover one has e2(t) < because we are eliminating the arrivals where the agent lost, 
and in this case the energy can only decrease. 

We are now in the position to prove the theorem on the fixation of the stochastic 
dynamics. 

Proof of Theorem In virtue of the translation invariance of the system, it is enough to 
prove the result for the agent in the origin. First observe that Mo < |iVi(0)| + 1^(0) | + 
| A^3 (0)| , because Ni(0) and A^O) may contain Poisson arrival times in which the agent 
O does not change his opinion. Denoting N{{0) by Aj for simplicity of notation, we shall 
prove that |Aj| < oo almost surely for i = 1,2,3. Let us first observe that the following 
inclusion relations hold: 

{|iVi(0)| + |AT 2 (0)| = oo} C {T = oo} U (|J {N x {0) = oo, T < n}), 

n>l 

and 

{1^(0)1 + 1^(0)1=00, T <n}c 

{[Arrivals in (0, n) of the Poisson process in the origin| = oo, Tq < n}. 
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Recalling that P({To = oo}) = we obtain 
P(\N 1 (0)\ + \N 2 (0)\=^) < 

oo 

P( | Arrivals in (0, n) of the Poisson process in the origin| = oo) = 0. 

n=l 

Thus we only need to show that \Ns(0) | is almost surely finite. First we observe that one 
has 

e x {t) = Hm |^-r E H An (a(r))-H An (a(r-))<Kpl a.s, (16) 

because the number of changes in the origin N\(t, O) is at most po (the maximum number 
of enlargements of the box observed by the agent x), and in any change the energy can 
increase at most by p Q . Finally, the spatial ergodicity yields the almost sure upper bound 
in (US]). 

At any time r £ A3 the energy H\(a(t)) decreases at least of one unit, i.e. H\(a(t)) < 
H\(a(t~)) — 1, otherwise the agent does not change opinion. Thus 

e 3 (t) = lim-^- £ H An (a(r)) - H An (a(r-)) 

n^oo A„ * — ' 

1 reNaitAn) (17) 

< hm zj^M = - E | jys ( t , 0) |, 
n— >oo |A n | 

where we have used once more the spatial ergodicity. 

By Lemma [2] and noting that the energy is initially zero (because agents choose +1 or 
— 1 with probability 1/2), one has the following inequality 

-Ep 2 < e(t) = ei(i) + e 2 (t) + e 3 (t) < ei(t) + e 3 (i), 

which holds uniformly in time t. Using inequalities (|16p and (|17p we obtain E|A^3(i,0)| < 
E/3q + < 00 uniformly in t, hence also in the limit as t — > 00. But E|A' r 3(0)| < cxd 
obviously implies jA^O)) < cxd a.s., so we have shown that Mo < \Nx(0)\ + |JV2(0)| + 
I JV3 (O) I < 00 and the proof is complete. □ 

Remark 5. We can also deduce, following the proof of Theorem [3l that 

P{N3 > o < Em±Em, 

as an immediate consequence of Markov's inequality. 
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Remark 6. Let us briefly comment on the connection between the fixation result just 
proved and the results of De Santis and Newman [3]. The improvement is twofold: namely, 
the dynamics considered here does not coincide (locally) with zero-temperature dynamics. 
It is immediate to prove that at any given time there is at least an agent which does 
not follow the zero-temperature dynamics. This implies that on any time interval the zero- 
temperature dynamics and our dynamics are almost surely different. Our could say, perhaps 
somewhat informally, that our dynamics is a perturbation of zero-temperature dynamics 
with the property of preserving fixation. Moreover, as already mentioned several times, 
our dynamics is non-Markovian, while the arguments used in [3] hold only for Markovian 
dynamics. 
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